Learning Accurate Integer Transformer Machine-Translation Models

نویسندگان

چکیده

We describe a method for training accurate Transformer machine-translation models to run inference using 8-bit integer (INT8) hardware matrix multipliers, as opposed the more costly single-precision floating-point (FP32) hardware. Unlike previous work, which converted only 85 multiplications INT8, leaving 48 out of 133 them in FP32 because unacceptable accuracy loss, we convert all INT8 without compromising accuracy. Tested on newstest2014 English-to-German translation task, our Base and Big yield BLEU scores that are 99.3–100% relative those corresponding models. Our approach converts matrix-multiplication tensors from an existing model into by automatically making range-precision trade-offs during training. To demonstrate robustness this approach, also include results INT6

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Weighted Transformer Network for Machine Translation

State-of-the-art results on neural machine translation often use attentional sequence-to-sequence models with some form of convolution or recursion. Vaswani et al. (2017) propose a new architecture that avoids recurrence and convolution completely. Instead, it uses only self-attention and feed-forward layers. While the proposed architecture achieves state-of-the-art results on several machine t...

متن کامل

Steps Toward Accurate Machine Translation

A novel multi-phase architecture for an accurate machine translation system is proposed. The system is divided into three phases: quick and dirty machine translation (QDMT), conceptual comparison and repair and iterate. QDMT generates the appropriate translation candidates (TCs) in the target language for the input sentence in the source language. Next, the system compares the meaning of the TC...

متن کامل

Machine Learning for Integer Programming

Mixed Integer Programs (MIP) are solved exactly by tree-based branch-and-bound search. However, various components of the algorithm involve making decisions that are currently addressed heuristically. Instead, I propose to use machine learning (ML) approaches such as supervised ranking and multi-armed bandits to make better-informed, input-specific decisions during MIP branch-andbound. My thesi...

متن کامل

Measure Transformer Semantics for Bayesian Machine Learning

The Bayesian approach to machine learning amounts to computing posterior distributions of random variables from a probabilistic model of how the variables are related (that is, a prior distribution) and a set of observations of variables. There is a trend in machine learning towards expressing Bayesian models as probabilistic programs. As a foundation for this kind of programming, we propose a ...

متن کامل

Interactive Machine Translation using Hierarchical Translation Models

Current automatic machine translation systems are not able to generate error-free translations and human intervention is often required to correct their output. Alternatively, an interactive framework that integrates the human knowledge into the translation process has been presented in previous works. Here, we describe a new interactive machine translation approach that is able to work with ph...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: SN computer science

سال: 2021

ISSN: ['2661-8907', '2662-995X']

DOI: https://doi.org/10.1007/s42979-021-00688-4